Model Selection

DPO Training

# DPO Training

Summllama3.2 3B

Text summarization model initialized from Llama3.2-3B-Instruct, optimized through large-scale summarization feedback DPO training

Large Language Model

ECE TW3 JRGL V5

ECE-TW3-JRGL-V5 is a new model obtained by merging the MoMo-72B-lora-1.8.7-DPO and alpaca-dragon-72b-v1 models through mergekit, integrating the advantages of multiple models.

Large Language Model

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase